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Let n points be placed independently in d— dimensional space according 
to the standard d— dimensional normal distribution. Let dn be the longest 
edge length for the nearest neighbor graph on these points. We show that 
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1 Introduction and main results 

In this paper we prove a strong law result for the largest nearest neighbor distance of points 
distributed according to a standard normal distribution in ^'^. Throughout this paper we 
will assume that d > 2. 

Let Xi,X2, ■ ■ ■ be independent and identically distributed random variables distributed 
according to the standard multivariate normal distribution in 3? . Let (/>(x), a; G 3f? , denote 
the standard multivariate normal density, 

4>{x) = {2Tr)"^/^ exp{-\\xf /2), 

where || • || is the Euclidean {£2) norm on ^'^. Let R = \\X\\. Then the probability density 
function of R is given by 

Mr) = ^rfe-'^'/V-i, 0<r<oo,d>2 (LI) 

where A^ = ^^^^ — . 

The basic object of study will be the graphs G„ with vertex set Xn = {Xi,X2, . . . ,Xn}, 
n = 1, 2, . . . . Edges of G„ are formed by connecting each of the vertices in Xn to its nearest 
neighbor. The longest edge of the graph G„ is denoted by dn- We shall refer to G„ as the 
nearest neighbor graph (NNG) on Xn and to dn as the largest nearest neighbor distance 
(LNND). 

The largest nearest neighbor link has been studied in the context of computational geometry 
(see Dette and Henze (1989) and Steele and Tierney (1986)) and has applications in statis- 
tics, computer science, biology and the physical sciences. Appel and Russo (1997) proved 
strong law results for a graph on uniform points in the d— dimensional unit cube. Penrose 
(1999) extended this to general densities having compact support. Penrose (1998) proved a 



weak law result for dn on normally distributed points, which states that ^/(2Togr^jdn — bn 
converges weakly to the Gumbel distribution, where hn ^ [d — 1) log log n. a„ ~ 6„ implies 
that ttn/hn converges to a constant as n ^ cx). In what follows we will write log2 n for 
log log n. The above result is also shown to be true for the longest edge of the minimal 



spanning tree. We are not aware of strong law results for the LNND for graphs whose 
vertices are distributed according to densities with unbounded supports for d > 2. For a 
detailed description of Random Geometric Graphs, their properties and applications, we 
refer the reader to Penrose (2003) and references therin. 

It is often easier to study the graph G„ via the NNG P„ on the set Vn = {-^i, -^^21 • • • , Xj\f^}, 
where iV„ are Poisson random variables with mean n. Then Vn is an inhomogeneous Poisson 
point process with intensity function n(j){-). Note that the graphs G„ and P„ are coupled, 
since the first min(n, iV„) vertices of the two graphs are identical. We also assume that the 
random variables A^^^ are non-decreasing, so that Pi G P2 C P^- ■ ■ . We now state our main 
result. 

Theorem 1.1 Let dn be the LNND of the NNG Gn, defined on the collection Xn ofn points 
distributed independently and identically according to the standard normal distribution in 

R^, d>2. Then, 

Vlog n dn d 

lim — ; = —;=, a > 2, a.s. (1.2) 

n^oo iog2 n V2 



2 Proofs and supporting results 

For any a; G 3f? , let B{x,r) denote the open ball of radius r centered at x. Let 



I{x,r)= 4>{y)dy. (2.1) 

JB{x,r) 

For p > 0, define I{p, r) = I{pe, r), where e is the d— dimensional unit vector (1, 0, 0, . . . ,0). 
Due to the radial symmetry of (p{x), I{x, r) = /(||2;||, r). The following Lemma (see Penrose 
(2003), Lemma 8.8) provides large p asymptotics for I{p,r). 

Lemma 2.1 Let {pn)n>i cind (r„)„>i be sequences of positive numbers such that pn -^ cxd, 
r„ -^ 0, and rnPn -^ 00, as n ^ co. Then, 

I{pn,rn) ~ (2^)-i/V^exp(/)„r„ - /92)(p„r„)-('^+i)/^ as n ^ 00. (2.2) 



In order to prove strong law results for the LNND for graphs with densities having compact 
support, one covers the support of the density using an appropriate collection of concentric 
balls and then shows summability of certain events involving the distribution of the points 
of Xn on these balls. The results then follow by an application of the Borel-Cantelli Lemma. 
The asymptotic behaviour of the LNND depends on the (reciprocal of the) infimum of the 
density, since it is in the vicinity of this infimum, points will be sparse and hence be farthest 
from each other (see for example Penrose (1999)). In case of densities having unbounded 
support, the region to be covered must be determined first. The following Lemma gives us 
the regions of interest when the points are normally distributed. 

For any c G 3f? fixed, and large enough n, define 



ii„(c) = ^21ogn + (c + d-2)log2n + 21og(^d), (2.3) 

where A^ is as in (1.1). Let A'^ denote the complement of set A. Let Un{c) be the event X^ C 
B(0,Rn{c)) and Vn{c) denote the event that at least one point of A'„ lies in B'^{0,Rn{c)). 
o^n ~ bn implies that a„ > c„ for some sequence c„ and Cn ^ bn- 

Lemma 2.2 For c> 2, P[f/^(c) i.o. ] = 0, and for c < 0, P[V,%c) i.o. ] = a.s. The 
result is also true with Xn replaced by V\„ provided A„ ~ n. 

Thus for almost all realizations of the sequence {A'„}„>i, all points of Xn will lie within the 
ball B{0,Rn{c)) c > 2, eventually and for c < 0, there will be at least one point of Xn in 
S^(0,i?„(c)) eventually. 

Proof of Lemma 2.2. As i?„ -^ cx), note that 

1 - /(O, Rn) := / A,e-^"/2i?f 1 ~ AdRt'e-^-^^ (2.4) 

•J Rn 

Hence, 

PiKic)] = 1 - (/(O, Rn{cW ~ AduRnic)''-^ expi- R^cf / 2). (2.5) 



Let Uk be the subsequence a , with a > 1, and consider 



P[Un=\\U^{c)] < P[at least one vertex of Xn^^^ is in B%0, i?„, (c))] 
= l-(/(0,i?„,(c))r'=+^ 
~ A,nk+iRi-\c)e-''"-^'^''^/' ^k-'^/'. (2.6) 

Thus the above probabiHty is summable for c > 2 and the first part of Lemma 2.2 follows 
from the Borel-Cantelli Lemma. Again, using (2.4) and the inequality 1 — x < exp(— x), we 
get 

P[F„^(c)] = (/(0,i?„(c))" < expi-AanRt'e-^'r./^^. 

Let Uk be as above. 



< exp(-A,nfci?f;2^e-^"'=+i/') 

~ exp(- constant A;"'^/^), (2.7) 

which is summable for all c < 0. This proves the second part of Lemma 2.2. 
If Xn is replaced by Pa„ , then 

PlKic)] = exp(-A„(l-/(0,i?„(c)))) 

^ A„(l - AdRnicf'^ expi-Rnicf/2)) ~ nAdRnic^-^ exp(-i?„(c)V2), 

which is same as the right hand side of (2.5). Similarly, one can show that P[V^(c)] has the 
same asymptotic behavior as in the case of Xn- Thus the results stated for Xn also hold for 

Proposition 2.3 Let t > ^=, and let rnit) = -2M£=. Then with probability 1, dn < Tnit) 

v2 -y/logn 

for all large enough n. 

Proof. Pick u, t such that (2d + c — 2)/2-v/2 < u < t, and e > satisfying 

e + u <t. 



Let c > 2. From Lemma 2.2, Xn C B{0, Rn{c)) a.s. for all large enough n. For m = 1,2,..., 

let v{m) = a™, for some a > 1. Let Km, (the covering number) be the minimum number 

of balls of radius ry(^\{e) required to cover the ball 5(0, i?;y(m_|_i)(c)). From Lemma 2.1, 

Penrose (1999), we have 

n^^i^X. (2.8) 

Vlog mj 

Consider the deterministic set {x ]",... ,x™^} G 5(0, i?i^(m_|_i)(c)), such that i?(0,i?,^(m+i)(c)) C 
U^-,i?(xr,r,(m)(e)). 

Given x G 3*?^, let Am{x) denote the annulus i?(x,r^(m_(_i)(u))\i3(x,r^(m)(£))) ^'^^ l^t Fm{x) 
be the event such that no vertex of Xj^tm) li^s in Am{x), i.e. 

Fm(x) = {A',(^)[^m(x)]=0} (2.9) 

Since, 



P[XieAUx)] = f (l){y)dy 



(X) 

> / Hy) dy 

= HRu{m+l){c),ri,(^m){u))-I{Ru{m+l){c),r^(^rn){(^)) (2-10) 

from Lemma 2.1, we get 

P[X^ G A„(x)] ^ Qe-^'('"+i)(^)/'(i?,(m+i)(c))-^. 

. (e^''('-+i)(^)'^''(-)(")(r^(„)(u))'^ -e^''('-+i)(^)''''('-)(^)(r^(m)(e))^) := g^. 

Substituting the values of Ru{m+i)i'^) ^^^ '''u{m){') ^^ Qm, we get 

(log m)('^-i)/2 



Qm+l^d+f-l-«v^ 



(2.11) 



Hence, 



P[F„,{x)] ^ (1 - gm)'^(-) < exp(-z.(m)gm) ~ exp |-C^J^||^|^| , (2.12) 

where C is some constant. Set Gm = U^Z!\Fm(x™). 

< /^ m X'^ I „(log m)^'^^-'^)/^ ' 



^p-i ~ i.i^j -"H-^ ,,;.,--.^ ■ <"^> 



which is summable in m for all u > {2d + c — 2)/2v2- By Borel-Cantelli, Gm occurs only 
for finitely many m a.s. 

Pick n,and take m such that a™ < n < a™+^. If dn > fn{t)^ then there exists an X G A:'„ 
such that Xn{B{X,rn{t)) \ {X]) = 0. Also note that X wih be in S(0, i?^(„+i)(c)) for ah 
large enough n, so there is some i < Km such that X G i?(a;™,r^(-„)(e)). So, if m is large 
enough, 

r,y(m)ie) +r^^rn+i)iu) < r^(m+i)(*) < rn{t). 

So, Fmixi{m)) and hence Gm occur. Since Gm occurs finitely often a.s., dn < T„(t) for all 
large n, a.s. The result now follows since e > and c > 2 are arbitrary. 

Now we derive a lower bound for dn- Let Vnit) = °^^" . 

Proposition 2.4 Lei t < -7=. Then with probability 1, dn > rn{t), eventually. 

Proof. We prove the above proposition using the Poissonization technique, which uses 
the following Lemma (see Lemma 1.4, Penrose (2003)). 

Lemma 2.5 Let N{X) be Poisson random variables with mean A. Then there exists a con- 
stant ci such that for all A > Ai, 

P[X>X + A3/V2] < ci exp(-Ai/2)^ 

and 

P[X <X- A3/V2] < ci exp(-Ai/2)^ 



Enlarging the probability space, assume that for each n there exist Poisson variables N{n) 
and M{n) with means n — v?''^ and Iv?''^ respectively, independent of each other and of 
{Xi, X2, . . .}. Define the point processes 



Then, "P" and V^ are Poisson point processes on 3f? with intensity functions (n — n^' )0(-) 
and (n + n^/^)0(-) respectively. The point processes "P", Vn and A'„ are coupled in such a 
way that V^ C "P^. Thus, if Hn = {V^ C A'„ C P^}, then by the Borel-Cantelli Lemma 
and Lemma 2.5, P[H!l^ i.o. ] = 0. Hence {P~ C A'„ C P^} a.s. for all large enough n. 

Pick numbers {2d + c- 2)/2\/2 < t < u. Let e > satisfy e + 1 < u. 

Consider the annulus An = B{0,Rn{c) \ B{0,R'^), where Rn{c) is as defined in (2.3) and 
R'n = Rni—i)- For each n, choose a non-random set {x",a;2, . . . ,x^ } C yl„, such that the 
balls B(xf, rn{u)), 1 < i < 0"„ are disjoint. The packing number cj„ is the maximum number 
of disjoint balls B(x,rniu), with x £ An- 

> Ri{c)-R'J /logn^-i 



^n(«) Vlog2« 

Let (i° be the LNND of the points of Xn that fall in An- By Lemma 2.2, there will be points 
in An for all large enough n, a.s. For any point process X and any B C 3?'^, let ^[i3] be the 
number of X in i?. Let En{x) be the event such that 

En{x)={V-[B{x,rn{e))] = 1} n {P+[i?(x,r„(n))] = 1}, 

where x G B{0,Rn{c)) \ B{0,Rn). Set X„ = P^ \ P^, and set Un{x) = B{x,rn{€)), and 
Ki(2;) = B{x,rn{u))\Un{x), then for each n and x the random variables P'^(ffi),P~(V„),X„(V„) 
andX„(f/„), are independent Poissons, and En{x) is the event that the first of these variables 
is 1 and the others are zero. Thus, 

P[En] = {n-j?/^)j (l){y)dyexpl-{n + n^/*) f Hy)dy] 

JB{x,r„{e)) \ JB{x,r„{u)) J 

= (n - n^/^)/(x, r„(e)) exp (-(n + n^/^)/(x, rn{u))^ 

> {n-n^/^)I{R'^,rn{e))exp[-{n + n^/^)I{Rn{c),rn{u))) 

~ Cduriie) exp«r„(e) - <V2)«r„(e))-^ 

• exp ( -C^nrf (n) exp(i?„(c)r„(ii) - i?,^,(c)/2)(i?„(c)r„(u)) 



d-l 



(log2 ") ^ / Ctj2 4 (nlog2 n) 

(log n)'i-2-^v^ "^""^ V (log n)'>'+^/2-i-«v^ 

(log2 n)^~ 
(log n)<i-2-^V2' 



(2.15) 



where the last relation follows since u < '^% . 

2v 2 

If Hn and En{x) happen, then there is a point X G Xn in B{0,Rn{c)) \ B{0,R[^) with no 
other point of Xn in B{x,rn{u)). Therefore 



K<r„}cF;^u U^n( 



(2.16) 



Vj=l 



From above, PlH^] is summable. We will show that P[U'^Il-^En{x'^)] is summable. Since 
by Lemma 2.2, there are points of Xn in An infinitely often, a.s., we conclude that d^ and 
hence d„ will be greater than rn{t) infinitely often a.s. 

The events En{xf), 1 < i < cJn are independent, so by (2.15), for large enough n, 



[JEn( 



L \i=l 



< Y[exp{-P[En{x^)]) 



i=l 



(log2 n)- 



""(logn)rf-2- 


-eV2j 




^log n^-^ 


(log2 n)- 


-1 

2 


\log2 nj 


(log n^-^- 


-eV2 


(log n)^^+i 


:\ 





< exp — Cicr 
~ exp —C2 

= exp(^-C2^^^^^^^(,_^)/,^, 

which is summable in n. Ci and C2 are some constants. This completes the proof of 
Proposition 2.4. 

Theorem 1.1 now follows from Propositions 2.3 and 2.4. 
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